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Abstract. We study the Circular and Jacobi /3-Ensembles and prove Gauss- 
ian fluctuations for the number of points in one or more intervals in the macro- 
scopic scaling limit. 



1. Introduction 

The circular (3 ensemble with n points, or C/3E„, was introduced by Dyson, [7], 
as a simplification of previous random matrix ensembles. It is a random process of 
n points distributed on the unit circle in C so that 

Kif) = ^ f f /(e* • ■ • , e*-)]A(e*S . . . , e^f d Al... d _pL (L1) 



Z n ,0 J-tt J -TV 2-7T 2TT 

for any symmetric function /. Here, A denotes the Vandermonde determinant, 

A(zi,...,z„) = JJ {Zk-zj), (1.2) 

l<j<fe<n 

and the partition function is given by 

_ r(l/3n+l) 

^ " [r(|/3 + !)]"' (L3) 

as shown in [9l[31]. This law also arises as the Gibbs measure for n identical charged 
particles confined to lie on the circle and interacting via the 2-dimensional Coulomb 
law. For this reason, it is also known as the log-gas. An important example of 
is as the distribution of eigenvalues of an n x n unitary matrix chosen at random 
according to Haar measure; this corresponds to (3 = 2. 

The second point process we will discuss is known as the Jacobi /3-ensemble with 
n-points. (We will abbreviate this name to J/3E„.) In this case, there are n-points 
on the interval [—2, 2] with joint probability density 



rt( Xl) ...,*») = -|A(.T l7 . . . ,x n )f [] (2 - x J ) a - 1 (2 + xjf- 1 , (1.4) 

3=1 

where the parameters a, b are positive real numbers. The partition function, Z, was 
determined by Selberg, [19] , The name Jacobi ensemble derives from the relation 
to the classical orthogonal polynomials of this name. 

In this paper, we determine the statistical behaviour of the number of particles 
in one or more intervals as n — > oo with the length of the intervals remaining fixed 
as n — > oo. This is what may be termed macroscopic statistics; the microscopic 
regime, where the length of the intervals scale as —, is much more difficult and will 
not be addressed. 
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The statement for the Jacobi ensemble is the cleaner of the two: 

Theorem 1.1. Given a sample from J(3E n , let N n {9) denote the number of points 
that lie in the interval [2 cos(#), 2] . Then for any distinct 0%, . . . , 6j G (0, it), 

y/^fcWnM-nBj], je{l,...,J} 

converge to independent Gaussian random variables with mean and variance 1 as 
n — ► oo. 

For the circular ensemble, we need one preliminary. Let us write $ for the 
zero-mean Gaussian process on (— 7r,7r) with covariance 



£{* (0i)*(0 2 )} = 



1 : e l = 9 2 

that is, ^(0) represents an independent Gaussian associated to each point in this 



arc. 



Theorem 1.2. Let us fix 6\ < 62 < • ■ ■ < 0j £ (— 7r,7r) and write N(a,b) for the 
number of points in a sample from Cf3E n that lie in the arc between a and b. Then 
the joint distribution of 



^§l[N n {e 3 A)-n e -^l j<l 
converges to that of ty(0{) — ty(9j) as n — ► 00. 
In the special case of a single arc, we see that 

(1.5) 

converges to Gaussian random variable of mean zero and variance one. 

For /3 6 {1,2,4}, Theorem II .21 was proved by Costin and Lebowitz, [2], At these 
three temperatures, the model becomes exactly soluble in the sense that there are 
explicit determinantal formulae for the correlation functions; see [5] or [IB]. There 
are many other works on Gaussian fluctuations at these three temperatures; in 
particular, we would like to draw the reader's attention to [3j |H [Til E2 H21 H3 [HI 
I5U] , One of the questions addressed in these papers are the laws of other linear 
statistics. Given a point process {xj}, say on R, and a function / : K — > R, the 
associated linear statistic is 

3 

In particular, the number of points in an interval I corresponds to / = Xi- 

For the circular problem with (3 — 2, the behaviour of linear statistics can be 
recast as a question about the asymptotics of Toeplitz determinants. In this case, 
we may attribute the proof of asymptotically Gaussian fluctuations to Szego, [25j . 
at least for / e C 1+e . Of the numerous papers devoted to Szego's Theorem in 
the last half-century, one stands out for its adaptability to the case of general /3, 
namely [TU] . This paper proves asymptotically Gaussian fluctuations for C/3E with 
/ G C 1+e . A subsequent paper, [TTJ, covers ensembles on the real line. 

The approach taken here is to use matrix models for C/3E„ and J/3E n that where 
described in |14j . This paper extended work of Dumitriu and Edelman, |5j, who 
discovered tri-diagonal matrix models the /3-Hermite and /3-Laguerre ensembles. 
By 'matrix model' we mean an ensemble of random matrices whose eigenvalues 
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follow the desired law. We should also mention [27] as the progenitor of both these 
papers. 

In [Sj, Dumitriu and Edelman proved Gaussian linear statistics for polynomials 
by studying traces of powers of their matrix models. 

In [15] , the circular matrix models where studied in the microscopic scaling limit 
using a Prufer variables approach. Having finished that paper, it dawned upon me 
that the use of Prufer variables leads to a very simple treatment of the problem of 
the number of particles in an interval. This is what is presented here. 

The virtue of using Prufer variables to treat the microscopic scaling limit of the 
matrix models was discovered independently by Valko and Virag, [29j . Specifically, 
they study the eigenvalue statistics of the /3-Hermite ensemble in a neighbourhood 
of a reference energy lying in the bulk. Earlier, [18] studied the statistics in a 
neighbourhood of the edge using the Riccati transformation. 

Notation. While dealing with the Jacobi ensembles, we will write X < Y to 
indicate that X < CY for some constant C. In all cases, the implicit constant will 
depend only on /?, a, and b. 

Acknowledgements. The author was supported in part, by NSF grant DMS- 
0401277 and a Sloan Foundation Fellowship. He is also grateful to the Institute for 
Advanced Study (Princeton) for its hospitality. 

2. Background 

2.1. A change of variables. In this subsection, we recount the results of p3 . This 
paper describes probability distributions on discrete measures with the property 
that the marginal distribution of the location of the mass points happen to have 
the laws that interest us. The virtue of the random measures described in [14] 
appears when one looks at them from the point of view of orthogonal polynomials. 

Given a probability measure dfi on the unit circle in C that is supported at 
exactly N points, we may write 

J fdp = J2»jf(e ie n- (2-1) 

Applying the Gram-Schmidt procedure to {1, z, . . . , z N ~ x } leads to an orthogonal 
basis for L 2 (dfi) built of monic polynomials. We write &o(z) = 1, . . . , $n-i(z) for 
these polynomials. 

As discovered by Szego, these polynomials obey a recurrence relation, 

3 fc+ i(*) = z$ fc (z)-ttfc*;(*) 

<$>i +1 {z) = <$>* k {z) - a k z<$> k {z) ( ' j 
with $Q — = 1< Here denotes the reversed polynomial, 

= z k M^), (2.3) 

and ocq , . . . , ctN-2 are recurrence coefficients belonging to D. Expanding z N in this 
basis shows that 

z<f> N - 1 {z)-e- lri <I>* N _ 1 (z) = mL 2 (dn) (2.4) 

for a unique r\ € [0, 27r). A basic fact in the theory of orthogonal polynomials is 
that the mapping 

{(6»i,/^i), . . . , {6 n ,^n)} i-> (ao, ■ ■ ■ ,a N -2,e~"') 
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is a bijection. For a proof of this (and the other claims in this subsection), see 
[20] or [26]. As in the former reference, we will refer to ao, . . . , ajv-2, e~'" ) as the 
Verblunsky parameters of d\i. 

It will be expedient for us to encode (|2 . 2|) in a different way. As the zeros of the 
orthogonal polynomials lie inside the unit disk, B k {z) = z$fe(z)/$^(z) is a Blaschke 
product of degree k + 1; moreover, 

B (z) = z, B k+l (z) = zB k (z) \~ * kBk{ t ] (2.5) 

1 - a k B k (z) 

Lastly, by Q2.4|) . we have supp(c?^) = {z : B N _i(z) — e - "'}. 

We have two systems of coordinates for the set of probability measures on S 1 
with exactly N mass points, on the one hand, the location and weight of the masses, 
on the other, the Verblunsky coefficients. Choosing random Verblunsky coefficients 
gives rise to a random measure and thus to a random subset of the unit circle. 
The good news is that they will be chosen independently. In the model for the the 
circular ensembles, they will follow a O distribution: 

Definition 2.1. A complex random variable, X, with values in the unit disk, D, 
is 0„-distributed (for v > 1) if 

E{f(X)} = & [[ f(z)(l M 2 ) ( ^ 3)/2 d 2 z. (2.6) 
J Jo 

Simple computations show E{X} = 0, E{|V| 2 } = ^ and E{|V| 4 } = (iy+1) 8 ( „ +3) ■ 
In the Jacobi ensemble, they will be Beta-distributed: 

Definition 2.2. A real- valued random variable X is said to be Beta-distributed 
with parameters s, t > 0, which we denote by X ~ B(s, i), if 

n.f(X)} = 21 l^ut^ t /(*)(! - + dx - ( 2 - 7 ) 



T{s)T{t) 



Note that E{X} = |=f and E{V 2 } = j^j^g . 

The combined content of Theorem 1.2 and Proposition B.2 from [14] is: 

Theorem 2.3. Given [3 > 0, ie< a k ~ ) 3(/ c+ i) + i fee independent random variables 
and let e tr) be independent and uniformly distributed on S . If one forms B n -i(z) 
according to (|2.5p . then 

{z : B n -t(z) = e-*>} 
is distributed according to the C(3E n ensemble. 

The result for the Jacobi model is similar; however, we need one extra ingredient. 
A probability measure on S 1 is invariant under complex conjugation, 

f(z)dfi{z) = J f(z)d(i(z), 

if and only if it has real Verblunsky coefficients. From Theorem 1.5 and Proposi- 
tion B.2 of [14], we have 



Theorem 2.4. Given [3 > 0, let a k be independent and distributed as follows 

(B($p + a,$l3 + b) k even, 

' i B(^f3 + a + b,^f3) k odd 
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Then the n points 

{z + z : B 2n _i{z) = -1} (2.9) 
are distributed according to the the Jacobi ensemble (| 1 -4[) . 

2.2. The basic processes. In the previous subsection, we showed how one could 
encode the f3 ensembles as random Blaschke products built from independent ran- 
dom variables through a simple recurrence, (|2.5p . In this subsection, we will take 
matters a few steps further; a similar path was followed in |15j . 

From Theorems l2.3l and l2.4l we see that we need only study the Blaschke products 
on the boundary of the unit circle. To this end, let us introduce random continuous 
functions ip k : (-7r,7r) R via B k (e w ) = e l ^ k( - e \ We break the 2ttZ ambiguity by 
choosing a branch for the logarithm in the recurrence relation (|2.5j) . Specifically, 
we define 

M8) = 6> 4>k+i{0)=M0)+0 + T(^ k (0),a k ), (2.10) 

where 

oo 

T(ip,a) :=-2Imlog[l-ae^] = Im^fe^V. (2.11) 

1=1 

As the argument of a Blaschke product, ip k is an increasing function of 9. 

Readers wishing to compare this with [TS] should note that here ip k (6) denotes 
the true Priifer phase rather than the relative Priifer phase. 

To motivate what follows, let us give a quick description of how we will prove 
Gaussian fluctuations in the circular case. The Jacobi case is analogous. 

If we choose 77, a k to be distributed as in Theorem [231 and use these to produce 
a sequence of increasing functions ip k (8) as in (|2.10|) . then the set of points 

{e ie : ipn-l(0) € 2irZ + V } 

will be distributed according to the C/3E„. In particular, the number of points lying 
in the arc [a, b] C (— tt, it) is approximately ^[ip n _i(b) — VVi-i(a)]; indeed the error 
is plus or minus one. In this way, it suffices to show that asymptotically, ip n _i(b) 
and ip n —i(a) follow a joint normal law. The error of ±1 will drop out in the limit 
when we divide by the square root of the variance. 

In the circular case, ip k (9) — (fc + 1)9 is a sum of independent random variables 
(because a k have rotationally invariant laws) and so it is easy to demonstrate that 
it has Gaussian behaviour. To study the joint distribution of ip n -i at several values 
of 9 takes a little more care, but can be dealt with using the Central Limit Theorem 
for Martingales. Implementing this argument requires a few estimates on T, which 
we record here. 

Lemma 2.5. Suppose 4>, -0 G R and a ~ 0^. If T(ip,a) = 2Im{e"^a} ; then 



E{T(0,a)} =E{f(V,a)} = 0, (2.12) 

E{T (0, a)T(6 a)} = ^ cos(^ - cj>), (2.13) 

E{f(^,a) 4 }= E^fegj , (2-14) 

E{|T(^,a)~f(^,a)| 2 } < (v+ * v+3) ■ (2.15) 
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Lastly, combining (|2. 13|) and ()2.15p shows 

E{|T(V.,a)| 2 } < I 4 IJ . (2.16) 

Proof. The vanishing of E{T(?/>, a)} merely uses the fact that a follows a rotation- 
ally invariant law. Specifically, for any < r < 1, 

Aog[l-r e W]£ = 
Jo 

by the Mean Value Principle for harmonic functions. The second identity in (|2.12[) 
follows in the same manner. 

To prove (|2.13[) we simply compute: 

E{f(tp,a)f((f),a)} = [ [ r 2 sin(6> + V>) sin(0 + <£) (1 - r 2 ) {lJ - 3)/2 r dr d9 

T J Jn 

Similarly, 

E{f(?A,a) 4 } = i^il // ^si^^ + ^^-f 2 )^ 3 '^*^ (2.17) 

w 7 Jd 

(2.18) 



= 48 

(i/+l)(i/+3)- 



Applying Plancharel's theorem to the power series formula for T gives 

oo 

EflTty.a) - T(^,a)| 2 } = £ £E{|a| 2/ } < 2(£ - l)E{|a| 4 }. (2.19) 

1=2 

Noting that ^ < 2 and E{|a| 4 } = (ly+1) 8 (iy+3 ) proves (|2~T5)) . □ 

The analogue of Lemma 12.51 for the Jacobi case is 

Lemma 2.6. Let T(ip, a) = 2[a — E(a)] sin(-0) with a ~ B(s,t) . Then 

E{TC0,a)} = O (2.20) 
E{f(i>, a)f(4>, a)} = {s+ty ^ +t+1} [cos(V - 0) - cos(^ + 0)] (2.21) 

for any </>, ^ £ K. Moreover, if a% is distributed as in (|2.8p . i/ien 

E{f(^,a fc ) 4 }<(fc + 1)- 2 (2.22) 

E{|T(V>, a k ) - 2a fc sin(^) - aj| sin(2^)|} < (k + 1)~ 2 (2.23) 

IE{im^)|}<(fc + ir 1/2 (2.24) 
where the implicit constant does not depend on k, merely (3, a, and b. 

Proof. Equation (|2 . 20[) is immediate from the definition, while (|2.21[) follows from 
(|A.5|) and 2sin(?/>) sin(0) = cos(V> — </>) — cos(?/> — 4>)- 
To obtain (|2.22|) . we can estimate rather crudely: 

E{f(^a fe ) 4 }<2 8 E{ a 4 }<(A : 2 + l)- 1 



GAUSSIAN FLUCTUATIONS FOR ENSEMBLES. 



7 



using (|A.4j) . Next we prove ()2.23|) . It follows from combining 



LHS(GL23|) = E 



OO ^ s OC 

;=3 ' ^m=2 



2 l a 2m 



2m+l 



m=2 

< |E{a 3 }|+2E<f-a 2 log[l-a 2 ] 



with (|A.3|) and Lemma [A. 11 

The last estimate follows from (|2.23[) and (|A.2p by applying the triangle and 
Cauchy-Schwarz inequalities. □ 



On several occasions we will make use of 

Lemma 2.7. Given real valued sequences e^, Xk, Yk with Xk+i = Xk + 5 + Yfe for 
some 5 £ (0, 2ir), 



E 



< 



2 ll e fcll-e° o + ll £ fe _ efe-ill^ 1 + Ikfe^felU 1 



\l-e i5 \ 

Proof. Using summation by parts and other elementary manipulations, 



n-l 



k=0 



oi5 



k=0 



"- 1 e iX k _ e iX k+1 "pj e iX k + 1 _ e i(X k +8) 

E efc — — + z2 ek ~ 

k=0 k=0 



c JX n , „iX , 



1 - e i5 

p i(X k +8)\ p iY k 



1 - e i<5 



l_ e « +2^ efe 1 



G i5 



fe=l 



fc=0 



This implies (|2.25p in a most obvious way. 



(2.25) 



□ 



2.3. Martingale Central Limit Theorem. In both the Circular and Jacobi 
cases, we will approximate ip m -i(0) by 



m— 1 



S(m,6) = E T(^ fc (0),a fc ). 



(2.26) 



fc=0 



The recursive definition of ^ and the fact that E{T} = shows that this is a 
martingale for the sigma algebras 

M m := o-(a , • • ■ , a m _i), 

that is, the smallest sigma algebra generated by these random variables. In the 
circular case, there is no need to pass from T to T in order to obtain a martingale; 
nevertheless, doing so simplifies the proof just a little. 
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Proposition 2.8. Fix 9± 



, 9 j distinct. Suppose that as n — > oo 



p-pr^E^^fc^J.afcjf^fc^O.a*)!^*}-^^ znL 1 sense (2.27) 



n-l 



1 



-^7- E ^TO^ W)| 4 } - 
lo § M fcO 

^Tfe(y, at )-t(4ft-),ai.) 



:E 



/or all 1 < j,l < J . Then in the random variables 
ipn-i(9j) - n9j 



(2.28) 
(2.29) 

(2.30) 



j e {1,2,..., J}, 

Vlog(n) 

converge to independent Gaussians of mean zero and variance a 2 . 

Proof. The first two conditions are far more than is really needed to apply the 
Martingale Central Limit Theorem to the sequence of random vectors 



■(S(n,( 



• S(n,9j)); 



see [TTIHB], for example. These references present the proof for scalar martingales 
- a small extension of the usual Central Limit Theorem — which can then be 
applied to any linear combination of the components of the vector. In this way, we 
see that 

-j±= (S(n, 9i), S(n, 9j)) N(0, a 2 8 ol ) 

in distribution. This extends to (|2.30|) because (|2.29p says that the difference be- 
tween these to vectors converges to in L 1 sense. □ 



3. Circular case 

In this section, we proof Theorem 11.21 The discussion in the previous section 
explains why this amounts to checking the hypotheses in Proposition ^. 81 We begin 
with a recap. 

Let a/c ~ / 3(fe + i)_|_i be chosen independently (as in Thcorcm l2.3p and then form 
the process ip n -i{9) by solving the recurrence (|2.10p . If rj is chosen independently 
from [0, 2tt) according to the uniform distribuion, then 

{e*:Vn-i($ + »7€27rZ} 

is distributed according to C/3E„. Notice that the number of points in the arc 
[a, 6] C (— 7r, 7r) differs from J^[ip n -x(b)— V'n-lfa)] by at most ±1. Thus Theorem ll.2l 
will follow once we show that 

7 i m [^{6)-n9]^^m 

in distribution. This in turn follows from Proposition 12.81 once we check its hy- 
potheses. The two lemmas that follow verify these conditions in the order they 
appear there. 
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Lemma 3.1. Given 81,62 £ (~ir,n), 



1 



log(n) 



]T E{T(V>*(0i), a k )T^ k {8 2 ), a k )\M k } 



k=0 



? :9l - d2 (3.1) 
:8 1 ^8 2 



in L 1 sense. 
Proof. By (j2J3j) . 



1 71—1 



(3.2) 



k=0 



/3(fc+l)+2' 



which immediately settles the case 8\ = 82. By symmetry, this leaves us only to 
treat the case 8\ > 82- We do this using Lemma 12.71 with 

X k =M9i)-M02), Y k =r^ k (8 1 ),a k )-r(ip k (6 2 ),a k ), e k - 
and S = 81 — 8 2 ■ Indeed, we may deduce 

e{ RHSGUI) } =0(l/log(n)) 
since by (l2~13)l . E{|F fc |} < 8[(3(k + 1) + 2)]" 1 / 2 . 
Lemma 3.2. Given 8 £ (— ir, ir), 



n— 1 

EE{|t(^(0),a fc )| 4 }-O 
lo S (") fco 



log(n) 



E 



2 ( 

n-1 



£Tty*(0),a fc ) -Tty fc (0), 



fc=0 



□ 

(3.3) 
(3.4) 



as n -> 00. 



Proof. From (|2.14[) , we see that not only does LHS (|3.3[ ) converge to zero, it is 
0(1/ log 2 (n)). We turn now to ||0|) . By (|2~T2|) and (|2~15| we have 

n — 1 

LHS(E1 = T — — EE{|T(^(^),a fe )- f (^ fc (^),afc)| a } 



log(n) ^ 



n-1 



< 



1 " * 
— V 



16 



log(n) ^ [/3(fc + l) + 2][/3(fc + l)+4]- 



which is 0(l/log(n)). Note that (|3.4[) implies (|2.29[) via the Cauchy-Schwarz in- 
equality. □ 

4. Jacobi case 

In the Jacobi case we choose a k to be distributed as in Theorem 12.41 For then 

{2cos((9) : -ir £ 2ttZ, 6>e(0,7r)} 

is distributed according to J/3E„ with parameters a and 6. As the Verblunsky 
coefficients are real- valued, ip k (0) = 0. Thus 

|Ar n (0)-^n-i(0)l<i 

with probability one. Recall from the introduction that N n (8) denotes the number 
of particles in the interval [2 cos(#) , 2] . 
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In light of this discussion, we see that in order to prove Theorem ll.il we need 
only show that for any distinct 9i, . . . ,0j £ (0, ir), 



converge to independent Gaussian random variables with mean and variance 1 
as n — ► 00. This in turn can be effected via Proposition 12.81 provided we verify its 
hypotheses. That is precisely what we will do. 



Lemma 4.1. Given 6\, 02 £ (0, w), 



1 



n-l 



log(n) 



E{T(^ fc (6i), a fc )T(^ fc (0 2 ), a fc )|7W fc } 



fc=0 



/9 




n = c-2 



(4.1) 



m L 1 sense. 
Proof. By (|2~211 . 



n-l 



1 " * 

LHSgTj) = — — V e k [cos(M&i) ~ Mh)) ~ cos(M0i) + Vfc (&))], (4-2) 
log(n) 



where 



4(fc/3+4a)(fc,3+4b) 



A: even 



(kp+2a+2b)' A (fc/3+2a+2b+2) 

4[(fc-l) / 3+4n+4fc](fc+l)/3 . , , , 
(fc / 3+2a+2&) 2 (fc/3+2a+2b+2) 1 * UUU ' 

Applying Lemma 12.71 with 

X k =M9i)+M02), Y k = Tty k (0 1 ),a k ) +r{il> k (0 2 ),a k ), 
and 6 = 0i + 02, then using (|2.24p , shows that 

n— 1 

LHSgU) = — -vY)efcCos(^ fc (fli)-Vfc(fe)) +0(l/log(n)). 
log(n) 



fc=0 



This renders (|4.1[) in the same manner as in Lemma |3. 11 
Lemma 4.2. Given £ (0,7r), 

-. n— 1 

— S— ^E{|f(^(0),a fc )| 4 }-O 
log (n) ^ 



\/^og(n) 



:E 



fc=0 



(4.3) 
□ 

(4.4) 
(4.5) 



as n — > co. 



Proof. By (|2.22| . not only does LHS ([44|) converge to zero, it is 0(l/log 2 (n)). 
We turn now to (03]). By ([2T2"5|) , 

n-l 



LHSE31) = 



1 



Vlog ( J- 



:E 



71 — X \ 

E 2E W sin(#(0)) - a\ sin(2^ fc (0)) j+o(-^±_). 



Lemma |2"771 with 6 = 0, X k = ipk{@), and e k = E{a k } shows that 



^2E{a fe }sin(^(0)) 



fc=0 



= 0(1) 
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and in a similar way, 



E 



^E{a 2 fe } sin (2^(0)) 



k=0 



0(1). 



In view of these estimates, (14.51) follows from 

n— 1 2^ n — 1 



E 



J> 2 -E{a 2 }] sin(2^(0)) - £ fi{ [a 2 - E{a 2 }} 2 sin 2 (2^(0))} 

^ k=0 

<J]E{ a J} = 0(l). 

The last deduction is based on (|A.4[) . 



fe=0 



□ 



Appendix A. Some integrals of Beta random variables 
As seen in (12.71) above, 



I(s,t) 



(l-x) s - 1 (l + x) t - 1 dx 



ys+t-l 



mm. 



r( s + t) 



indeed this is the famous Beta integral of Euler. In this appendix, we record a few 
simple computations that were needed in the text; nothing is novel. 
By the binomial theorem, 



fe = 2- fe ^(-l)"M \{l-x) m {l + x) 



\k — m 



and so if X ~ B(s,t), then 



E{*n = £(-irP) 

m=0 ^ ' 



k \ I(s + to, t + k — m) 
mj I{s,t) 



With a little algebra one then obtains 
t-s 



E{X} 
E{X 2 } 
E{X 3 } 
E{X 4 } 
In particular, 



t + s 

(t - s) 2 + {t + s) 
(t + s)(t + s+l) 
(t - s)[{t - s) 2 + 3{t + s) + 2] 
(s + t)(l + s + t){s + t + 2) 
(t - s) 2 [(t - s) 2 + 6{s + t) + 8] + 3(t + s) 2 + 6(t + s) 
{s + t)(l + s + t)(s + t + 2)(s + t + 3) 



E{(X -E{X}) 2 } 



Ast 



(t + s) 2 (t + s + l)' 



(A.l) 
(A.2) 
(A.3) 
(A.4) 

(A.5) 
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Differentiating I(s, t) with respect to s or t gives access to expectations of loga- 
rithms. For example, 

(d s + d t )[I(s + l,t+l) + I(s,t)] 



E{-X 2 log[l - X 2 }} = 



I(s,t) 



(s -t) 2 + s + t 



[2* (s + t) - - *(s) - 2 log(2)] 



(s + *)(l + s + t) 

(s + f)(s-t) 2 + £ 2 + .s 2 
+ (s + t) 2 (l + s + i) 2 ■ 
Where \&(x) := cirlogorfx) is the digamma function. 
Lemma A.l. Given s,t G (e, oo) obeying \s — t\ < 8 and X ~ -B(s, i), 

E{-X 2 log[l~X 2 ]}<( S + i)- 2 
where the implicit constant depends only on e erne? <5. 
Proof. In light of the calculation above, we need only prove 

|2*(s + 1) - *(*) - *(s) - 21og(2)| < (s + i)- 1 . 

This in turn can be deduced from ty(x) — log(a;) + 0(x^ 1 ) as x — > c 
weak form of Stirlings formula. Indeed, 

2 log(s + i) - log(t) - log(s) - 2 log(2) = log 

which is < (s + t)~ 2 . 



(A.6) 



= log 



a very 



□ 



[2. 

p: 

K 

[S. 

[(>. 
p. 
p: 

[9 
[10 

[11 
[12: 
[is: 
[14: 
[is: 
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